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-TITLE FALMAN = Cluster Failover Manager 
-IDENT ‘V04-000' 


~PARARARBABAALAAASAAALALLALEALE SSSA RASA SALES EAA E SARA AERA SSSA RSE A RS ER RRA AERA SS 


COPYRIGHT (c) 1978, 1980, 1982, 1984 BY 
tie ™ ae eS CORPORATION, MAYNARD, MASSACHUSETTS. 


HED UNDER A LICENSE AND MAY BE USED AND corte 
H THE TER an OF SUCH Tat AND WITH THE 
U F THE ABOVE SAC TDER OTICE. THIS SOFTWARE OR ANY OTHER 
E BE PROVIDED OR OTHERWISE MADE AVAILABLE TO ANY 
R O TITLE TO AND OWNERSHIP OF THE SOFTWARE IS HEREBY 


;* * 
tt + 
+e * 
* * 
sf x 
;* *® 
.f ® 
* * 
* ® 
:* * 
;* TRANSFERRED. 7 
: * 
* > 
ft x 
sf * 
ft * 
tt * 
* * 
tt * 
* * 
* * 


PAN 


L RIGHT 
THIS $O RNI 
ONLY IN em ie WI 
INCLUSION 
COP! 
OTHER PER 


THE INFORMATION IN THIS SOFTWARE IS SUBJECT TO CHANGE WITHOUT NOTICE 
AND SHOULD NOT BE CONSTRUED AS A COMMITMENT BY DIGITAL EQUIPMENT 
CORPORATION. 

DIGITAL ASSUMES NO RESPONSIBILITY FOR THE USE OR RELIABILITY OF ITS 
SOFTWARE ON EQUIPMENT WHICH IS NOT SUPPLIED BY DIGITAL. 


; RATATAT AATAAAAACEARAAAEARAAEARAAAARAAAARAAAAARAEAAARAARAAAAETEARAAEAAEEEAEE 

++ 

; FACILITY: EXECUTIVE, CLUSTER MANAGEMENT 

; ABSTRACT: 

: This module contains the routines that direct failover in a 
VAXc luster. 

; ENVIRONMENT: VAX/VMS 

; AUTHOR: David W. Thiel, CREATION DATE: 24-May-1983 

; MODIFIED BY: 


: V03-014 DwT0225 David W. Thiel 11-Jul-1984 
3 Change call to temporary name EXESMNTVERSP2 to 
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V03-013 DWT0222 David W. Thiel 25-Jun-1984 
Revise CNXSFAILO_ SYNC te aintnise time delays 
= synchronizing failover steps. 


v03-012 DWT0220 David W. Thiel 8-May-1984 
Correct previous fix to handle reference to Zyabol 
outside of this image. 


vO3-011 DWT0219 David W. Thiel 8-May-1984 
“fe syachrentzat ton between 1/o and lock menseer 
clusters. 


v03-010 DWT0188 David W. Thiel 99-Mar-1984 
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Correct synchronization routine to correct 
thread when it is discovered that another f 
has begun. 


v03-009 $RB0110 Steve Beckhardt 22-Feb-1984 
Added more failover table entries for improved 
consistency checking. 


V03-008 DWT0154 David W. Thiel 29-Dec-1983 
Use single table for addition and removal of nodes. 
pnenge names of CNX$BEGIN_FAILIN and 
CNX$BEGIN_FAILOVER to CNXSMEMBERSHIP_CHANGE. 
Remove obsolete failover table entries and ; 
CNXSFAILO_QWAIT routine. 


V03-007 DWT0137 David W. Thiel 07-0c t-1983 
Differentiate cases of a certain future failover 
(based on existence of a broken connection to a 
cluster member) and a pending failover (node has 
been failed out, but failover table processing 
has not begun due to the state of a previous 
instance of failover table processing) using the 
CLUBSV_LOST_CNX bit. 


V03-006 DWT0121 David W. Thiel 20-Aug-1983 


v04-00 Page é FAI 
ALMAN.MAR; 1 (1) 
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Add failover table entry to start journal recovery. 
V03-005 DWT0116 David W. Thiel 11-Aug-1983 
Correct Bynenrand ogy ten logic. Correct quorum 
waiting after a failover. 
v03-004 DWT0111 David W. Thiel 27-July-1983 
Add fail-in table. 
cenvers interface to failover routines from JMP to 
v03-003 SRB0095 Steve Beckhardt 9-Jun-1983 
Add entries to failover table. 

00 v03-002 DWT0104 David W. Thiel 8-June-1983 
0 Add failover routine CNXS$FAILO_QWAIT that waits for 
0 quorum before soapnet ing. 

Add CNX$CHECK_FAILOVER to test for and initiate 
a pending failover. 


v03-001 DWT0103 David W. Thiel 27-May-1983 
Add index definition argument to FSTEP macro, 
ae acket routines for calling existing failover 
routines. 
Avoid trying to send message to a local node. 
Fix logic error in synchronization code. 
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CORP offsets 
Cluster message definitions 


PDEF 
GDEF 
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FRR AAA AERA AAA AAA AAA AAA AAA AAA AAA AA RARER AEE eS 
. 


: NOTE: The following assumptions are in effect for this entire module. 


° 
FRA AERA AERA AEA AAA AAAAAAAAAAAAAAARAAKAAEE AeA eee 


ASSUME IPL$_SYNCH EQ IPL$_SCS 
ASSUME IPL$"SYNCH EQ IPLS_TIMER 


113 
114 
! 5 
1 : SCOR 3 
1 SCLSMS 5 
1 SCLUBDEF ; CLUster Block offsets 
120 SCSBDEF ; CSB Offsets 
121 SFKBDEF 3: Fork bloc offsets 
0 1 § SIPLDEF 3; IPL definitions 
° ! Z SSSDEF ; Status code definitions 
125 ; 
8 1 § ; MACRO DEFINITIONS: 
0000 1 : 
0000 128 
0000 129 «MACRO FSTEP ADDR,STEP_INDEX,?LABEL 
0000 130 ~SHOW BINARY 
0000 131 ; Self relative address 
0000 1 ¢ LABEL: .LONG ADDR=-LABEL 
0000 1 -IF NB STEP_INDEX 
0000 §=134 ~SHOW EXPANSIONS 
0000 135 STEP_INDEX= <LABEL-CNXSFAILOVER_TABLE>@<-2> 
4 136 -NOSHOW EXPANSIONS 
000 137 ~ENDC 
0000 138 »NOSHOW BINARY 
444 } H ~ENDM = FSTEP 
4 141 seheneaneeovessers TEMPORARY DEF INI TIONS ¢ #eeeenane 
1 3 
0000 =1 5 -ITIF NOT_DEFINED CLUFCBSV_WAITING, CLUF CBSV_WAITING= CLUF CB$V_FKB_BUSY+1 
8 3 : 7: -1IF NOT[DEFINED CLUFCBSM_WAITING, CLUF CBSM_WAI TING= TaCLUF CBSV_WAITING 
0 1 § | gadadanesandsedboeeneisiaaaetbesnebeeurienetboenenbeneseeneeenseneeses 
0 1 
0 148 
8 0 149 
3 150 
0000 182 
te 
154 
0000 155 
0000 156 
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FUNCTIONAL DESCRIPTION: 


The failover table Lists the sequence of steps in failing nodes 
out of a cluster. Each table entry is the self-relative address 
of a routine that performs one step. 


if any failover step breaks the thread of IPL SYNC execution, it 
s possible that oneener failover may be needed at that point. 
The way in which this is handled depends on a mode of failover 
table processing. In NOREFAIL mode pend: tne of a subsequent 
failover until either this mode is left or the end of table 
processing is reached. In REFAIL mode, this failover is 
abandoned and processing of the table entries is reinitiated 
from the beginning. 


The table is processed as follows. Each node independently 
executes pe steps in the failover table, synchronized only at 
the begin ng and when specific requests for synchronization 
are made. Control of synchronization and other aspects of 
failover processing are themselves controlled by table entries. 
These special tables entries are described next. 


CNXSFAILO_NOP == No Operation 

NOP entry. Used to reserve space for patching. 
CNXSFAILO_END == End of Table 

Marks the end of the table and terminates table processing. 
CNXSFAILO_REFAIL -- Set Re-failoverable Mode 


Specifies that this failover may be abandoned at any time and 
another begun. In this mode, any suspended thread must save 
the failover identification and validate it upon being 

resumed. If the ID has changed, the thread must be terminated; 
otherwise, it may proceed. 


CNXSFAILO_NOREFAIL =~ Clear Re-failoverable Mode 


Specifies that this failover mey not be abandoned and that 
no new failover may be begun. While in this mode, failover 
rout ines MAY NOT suspend themselves without a guarantee of 
resumption. In particular, a routine MAY NOT wait for a 
message to arrive from the corresponding routine on another 
note. Furtneraere, a routine MAY NOT wait for process 
execution. 


CNXSFAILO_SYNC == Synchronize Failover Steps 
Requests synchronization at this step, One node is selected 
e 


9s the synchronizer when failover ta processing is 
nitiated. When this table entry is encountered, every 
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node sends a READY message to the peasroneres node. When 

the synchronizing node executes this entry, it waits for a 

READY from every other node involved in the failover and 

then sends a DOSTEP message to every involved node. It 

one zoe Sonat nodes then complete this step and proceed to 
e next. 


<= 

mw 
2D @& 
re 
mo 


Following a synchronization step, it is guaranteed that every 
other anyoty d node has completed the step preceding the 
; nchronization step. Due to delays and queuing of mensoent, 
is still possible to receive a message from another node 
that was queued before the synchronization step. 
Requesting a synchronization step ALWAYS puts the failover 
into Re-Failoverable Mode. This results in the very substantial 
simplification which allows any communications problems during 
synchronization to be handled as part of a subsequent failover. 
The following description applies to every failover routine: 


CALLING SEQUENCE: 
Invoked by: 
JSB failover_routine 
IPL is IPL$_SYNC = IPL$_SCS = IPLS_TIMER 
To continue failover processing, return with: 


To abandon failover gresesstag in REFAIL mode: 
ADOL #4,5S 


RS 
To aurnine te failover processing in either mode: 
JMP CNXSEND_FAILOVER 


A failover routine runs as a fork process and must 
behave according to the general rules for fork processes. 


INPUT PARAMETERS: 
04(SP): Address of caller's caller -- return here if thread 
is suspended 
OO(SP): Return here to continue failover processing 
R5: Address of CLUster Failover Control Block (CLUFCB) 
R4: Address of CLUster Block Sag 
R3: Failover ID (copy of CLUFCBSL_ID(R5)) 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 


; SIDE EFFECTS: 
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5 Registers RO-R5 need not be preserved. gotten J interesting 
: that a failover does is a side effect of some sor 

-PSECT $$$060,LONG ; Read only Data PSECT 
CNXSFAILOVER_TABLE: 
FSTEP ENXSE ALLO NOREFAIL t be first entry 
CONG CNXSFAILO _NOREFAIL- 300008 
FSTEP ERSTALL ALL al k requests 
Nitth “CONG LcKssTaut Btie iat . 
: Everything up to this point must be synchronous with the beginning 
; of failover table execution. Note that the beginning of failover 
3; table execution is not necessaril synchronous with the receipt of 
3; the phase 2 message in the case when a failover is already in 
3 progress. 
: FSTEP NXSF ALLO REFAIL recede synchronize 
$500 CON NG CNXSFAILO erat Sh0028 
FSTEP NXSEALLGS NC synch nize 
or ONG CNXSFAILO_ 5 N¢~3000 $ 
FSTEP eyei nits NOREFAIL “Don't allow 0 -failovers 
0004$: .CONG CNXSFAILO. NOREFAIL- 300045 
FSTEP gNXs10- SYNC : rialize 1/0 
000 ONG CNx$10_ SYNCH 300088 
FSTEP CKSSET_STATE1 ; Set rebuild state 1 
0006$:~.LONG LCKSSET_STATE1-30006$ 
FSTEP RASPAILO REFAIL recede synchronize 
$500 CONG CNXSFAILO_ rerart $6007s 


FSTEP NXSP ILO SYN Ri chro 
0008 sCONG Aguxsearvo. s Nc 300088 

FSTEP CUNSEAILO L t Lox re-failovers 
30009$: ONG PATA YSFAILO _NOREFAIL- 0009S 


FSTEP ,ersinl NIT REBUILD ht master copy locks 
01 TLONG LCKSINIT. ‘REBUILD ~30 
7 and tresreed entries 


FSTEP  CNXSFAILO_REFAIL 
ester Feveeniuav tree CNRS ALLO, REFAL 
e 
00128: "CONG CNXSFAILO_ Sing 500138 


aif $58ifs synchronize 


FSTEP NX$ AILO_N OREF AIL allow re-failovers 
$b01 S$! “CONG CNXSFAILO _NOREFAIL= - 300188 
FSTEP K$s ’ _STATE2 ild state 2 
bool .LONG LCKSSET bra Teo 880148 
FSTEP BASE AILO REFAIL de synchronize 
DOSS. CONG CNXSFAILO_REFAS, <S00188 


FSTEP CUXSPRILO- SYNC 3 Synchronize 
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300168: sLONG CNXSFAILO ~$YN¢~300168 
CNXSFAILO NOREFAIL ot Loy Cgetatlovers 
0017$: .CONG CNXSFAILO. NOREFAIL= 0017$ 


BUILD_LKBS build locks 
Boo tone Lcxsresuiip Leas 300i 8s 


tOCCHG CNXSFAILO_ neFart-S66%9s aries 


SYNC e 
. CONG CNXSFAILO 347380268 
IL6 NOREF AIL how fea restovers 
CONG CNXSFAILO. poner agi 2021S 


A 
$: 

A 

$: 

A 

$: 

ET_STATE3 id state 3 
$:".LONG LCKSSET _braTEs¢$00138 

A 

$: 

A 

$: 

A 

$: 

E 

$: 


i 
t OCONG CNXSFAILO_ ReFait-So6ess seg e 


SYNC synen ronize 

. CONG CNXSFAILO_ 5 N¢=300248 
ILO NOREF AIL “Don L tgs re-failovers 
“CONG CNXSFAILO _NOREFAIL- 0025$ 


BUILD_R uild_ resources, grant 
: Tone **Lexsresuiid. *RSBS_30026% 
3 unprotected locks 


NSF AILO REFAIL r 
$b0> CONG CNXSFAILO_ nears 300 $55 
SYN synen cont ze 
00e8s CONG CNXSFAILO_ SYNC=30028$ 
NX FAILONN OREFAIL “Don't allow re-failovers 
0029$: .CONG CNXSFAILO. ROREFATL © 300098 


CKSSET_STATEO rebuild state 0 
0030$:".LONG LCKSSET_ ira TEoe 0030$ 


CNXSEAILO REFAIL 

00318: CONG G cnxse Aico feraite$ 
e 

SOD SSS CONG. CNXSFAILO. SYNE 3O0SSE 


synchronize 


aif $568fs°¥renron'2* 


SGO8SS: “LONG LcKSRESUME UNPROTSSOOSSS ished es 


3 erage Locks 


NXSFAILO_JNL SP/R 
b086$. CONG CNXSFAILO. ca -30034$ 
FA 


ILO_SYNC ronize 
SOOSSS CONG, CNXSFAILO. SYNE~S003SS 


LeOkes CONG. LCKSRESUME wAiteSoBSe5°**** waiting for any 


3; «type of lock request 
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4 349 FSTEP NXSFAILO_NOREFAIL : Don't all -failovers 
00000010' i 0 SOOS7Ss CONG. CNXSFAILO_NOREFATL=S00375 7 
FSTEP KSREBUILD _RSBS ; Rebuild resources, grant 
FFFFFFOB' 38 bob Sa8. LONG LCKSREBUILD RSBS-30038$ . 
ae 338 3 protected (all) locks 
9C 4 FSTEP NXSFAILO_REFAIL,FORCE_END 3;_Endin int 
00000003" 009 GOOSOS CONG ENXSFATEO. REFAIL= 300908" °° 
+ ~SHOW EXPANSI 
00000027 A FORCE END= <30039$-CNX$FAILOVER_TABLE>@<-2> 
BA 355 FSTEP NXSFAILO END ; End gf fap e 
00000017" Ad 0040$: .CONG CNX$FAILO_END-30040$ 
000000C4 o¢t 3} ; -BLKL 8 3; Reserve patch space 
(4 359: 
0000 44 ¢ -PSECT $$$100,LONG 
itd 62 -DEFAULT DISPLACEMENT , WORD 
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-SBTTL CNXSMEMBERSHIP_CHANGE - Begin membership change actions 


FUNCTIONAL DESCRIPTION: 
——— ERSHIP_CHANGE 
This routine is called whenever a node or nodes are added to 
er removed from a cluster, inc Luding initial cluster formation. 
sone are under tan by the aonyere af the cluster to 
adj yst o the new cluster membershi P: If an uninterruptible 
s already in progress, handling of the new event is deferred. 
CALLING SEQUENCE: 


JSB a iat ana CHANGE 
IPL is IPL$_SCS 


INPUT PARAMETERS: 
NONE 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 
SIDE EFFECTS: 
RO-R5 are destroyed. 


SOCSCSOCOCSOOOSOOSOSOOSOOSOOOOOOOOOOSOOOOOOOOOOOOOCOoOOo 
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-ENABLE LSB 
CNXSMEMBERSHIP_ CHANGE: : 
54 0000000'GF 00 1$: MOVL “CLUSGL_CLUB,R4 ; Address of CLUB 
5 010C C4 4 CLUB$B cCUFCB (RG) .R5 ; Address of oe ae control block 
05 20 a5 00 € BBC acl UF FCBSV_ACTIVE ; Branch if no active failover and 
CLUF CES ASTATUS AS) , 108 : . set active failover flag 
20 AS 02 C8 BISL2 #CLUFCB 3; Set failover pending flag 


CLUFCBSL. STATUS(RS) 
RSB ; Return, will do failover later 


10$: BICL2 #<CLUFCBSM_PENDING ! - ; Clear failover pending flag 
CLUFCBSM~SYNC_NODE>, -; and local synchronizing node flags 
CLUF CBSL STATUSTRS) 
MOVL CLUBSL_LST_X TNeRGS. ° 


CLRL 
MOVL 


C 

R4) 

CLUFCBSL_ SYNC. CSBC(RS) 

CMPL CL ; Was this node the failover coordinator? 
chy SCTLOCAL. CSB tRA) 

BNEQ ; Branch if n 


BISL2 #CLUFCBSM_SYNC_NODE, - : Make this nade the failover sync node 


20 AS 06 «(CA 


FAM $F NOQOQOOOCOCOCOCOSOSCSCSOSOOCOSCOOCOOOOOOOOOOOOOOOoO 


1 AS 634 A4 «(=O 


18 AS 4 
2405 5C A DO 


10 Ad «=5C ASOT 


; Make last transition ID the failover 

: identifier 

; Initialize failover step 

; Set up CSB address of synchronizer node 
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CLUFCBSL_STATUS(RS5) 
PUSHL R ; Save CLUFCB address 
ASSUME SaLUE CESS NODEMAP+CLUF CB$S_NODEMAP> EQ CLUF CBSB_RESPMAP 
MOVCS #0,(SP),# . 


’ » © ear node map and response map 
#cLUFCBSS NODEMAPSCLUF CBSS_RESPMAP, - 
CLUFC ) 


‘itt 
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28 AS «6000460 8F «600 6E Sis 
B_RODEMAP(R 


20 BA POPR > 3; Restore CLUFCB address 
free BSBW CNXSSCAN_CSBS ; Iterate over all CSBs 
24 E BLBC $ ; Branch when done 
OE 60 AS O01 €E1 BBC ac ; Branch if CSB is not for a member node 


09 60 AS 00 «£0 


AS 3 


BBS #cs LON cee ; Branch if connection permanently broken 
50 64C 
00 28 AS) 6550)—CO'E 


C 
MOVZWL CSBSW-CSID_IDX(RS) ,RO ; CSID index of member node 
BBCS RO, = 3; Mark node present 

CLUF CBSB_NODEMAP(R5) ,30$ 


PREP RRO © 


30$: RSB 


; Failover routine may jump here to exit failover processing 
ENXSEND_FAILOVER: : 
MOVL 


54 pooonege et dO G*CLUSGL_CLUB,R4 ; Address of CLUB 
55 010C C4 QE MOVAB CLUB$B_CCUFCB(R4) ,RS ; Address of failover control block 
18 AS 27 00 MOVL #FORCE_EN ; Set index to force end of table processing 


D - 
CLUF CBSL_STEP(RS) 
: Begin a failover step 


“ 

“ 

o 4 

wn .~/ 
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53 1C AS) DO 40S:  MOVL  CLUFCBSL_ID(R5) ,R3 ; Failover ID 
50 18 a5 DO MOVL  CLUFCBSL~STEP(RS),RO _: Get index of next failover step 
50 Q000'CF40 ODE MOVAL W*CNXSFATLOVER_TABLECROJ.RO ; Get table entry 
: R5: Address of failover control block (with available fork block at head 
: R4: Address of CLUB 
3 R3: Failover ID 
00 B040 16 : JSB a(r0) CROJ ; Convert self relative to absolute address 
54 Q0000000'GF D0 MOVL G*CLUSGL_CLUB,R4 ; Address of CLUB 
55 010C C46 9E MOVAB CLUBS$B_CCUFCB(R4) .R5 ; Address of failover control block 
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to continue this failover, start again at the beginning of the fable, or drop ever 
: saoutng that a new failover will begin soon (the Last case is assured if a connect 
uster member is broken). 


BBS #CLUFCBSV_ACTIVE, - ; Branch if new failover may not 
CLUF CB 5 STATUS (RS) ,508 ; commence here 

BBS #CLUFCBS9_PENDING, - ; Branch if another failover is pending 

CLUF CBSL STATUS(R5) ,60$ ; and initiate it 


CD CDCD 6909 C9 SINISE AA AAA A MMIII 


; Use the bits CLUFCBS$V_ACTIVE, CLUFDB$V_PENDING, and CLUBSV_LOST_CNX to decide whet 


s toac 
0A 20 AS 00 €0 7 
0A 20 a5 01 €0 


08 1 AS «=617~—COEO 


Sooo 
OO Occ c 
239 OO NNNNNN 
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BBS Het v_COST_CNX, = Branch if a connection has been lost and 
CLUBSL_FLAGSTR4) ,70$ and new failover will soon happen 


18 AS 06 $8 508: INCL CLUFCBSL_STEP(RDS ; Advance to next failover routine 
cE a BRB 4 ; Do next step 
FF62 = 31 98 60$: BRW 1$ ; Start a new failover 
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rAnan - Shuster Failove 
v04-000 CNXSMEMBERSHIP_CHANGE = 
05 O03 $78 70$: RSB ; Drop thread -- another failover will come 
oF 186 - DISABLE LSB 
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-SBTTL CNXSFAILO_REFAIL = Allow new failovers to start 
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FUNCTIONAL DESCRIPTION: 


Enter REFAIL mode in which new executions of the failover 
table may be initiated. 


CALLING SEQUENCE: 


JSB CNXSFAILO_REFAIL 
IPL is IPL$_SCS 


INPUT PARAMETERS: 


R5: Address of Failover Control Block 
RS: Faflover Ib 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 
SIDE EFFECTS: 
NONE 
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CNXSFAILO_REFAIL:: . 
BICL2 #CLUFCBSM_ACTIVE, - : Clear active bit 
ial CLUF CB$L_STATUS(RS5) 
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-SBTTL CNXSFAILO_NOREFAIL = Prevent second entrance to failover table 
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FUNCTIONAL DESCRIPTION: 

Stop allowing new failovers to begin. 

Terminate any previous failover that may have been in progress. 
CALLING SEQUENCE: 


JSB CNXSFAILO_NOREFAIL 
IPL is IPL$_SCS 


INPUT PARAMETERS: 


R5: Address of sailover Control Block 
fas Pelisce hee 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 
SIDE EFFECTS: 
NONE 


CNXSFAILO_NOREFAIL:: 
BBSS #CLUFCBSV_ACTIVE, - ; Set active bit and branch 
CLUFCBSL_STATUS(R5S),10$ : if it was already set 
pace #CLUFCBS$V_FKB BUSY, - 
CL FCBSL_STATOS(R5S 108 : and mark it free 


; Remove from queue 


10$: 


oe 
BICL 


RSB 


( 
#CLUFCBSM_WAITING, - 
CLUFCBSL_STATUS(RS5) 


; 5 
3; Branch if fork block free 


; Clear waiting bit 


13 
(6) 
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: § -SBTTL CNXSFAILO_NOP = Null failover routine 
Be 
8 f 3 FUNCTIONAL DESCRIPTION: 
0B 72 ; Null failover routine. A legitimate failover routine that does 
44 Y : absolutely nothing. 
086 f3 ; CALLING SEQUENCE: 
oes % : JSB CNXSFAILO_NOP 
Oee a : 1Pe is IPL$_SCS 
086 p : INPUT PARAMETERS: 
Boer 2 ¢ : RS: Address of Failover Control Block 
086 ; R4: Address of CLUB 
Boee age 3 R3: Failover ID 
0086 386 ; OUTPUT PARAMETERS: 
i me 
bone 590 : COMPLETION CODES: 
00B6 591; 
ee i; we 
0B6 594 : SIDE EFFECTS: 
0B6 595; 
00B6 596; NONE 
0086 597; 
ae 
00B6 600 CNX$FAILO_NOP:: 
05 0086 601 hy) 
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-SBTTL CNXSFAILO_END = End failover table processing 
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FUNCTIONAL DESCRIPTION: 


End failover table processing. This assumes that REFAIL 
mode is in effect. 


CALLING SEQUENCE: 


JSB CNX$F AILO_END 
IPL is IPL$_SCS 


Pt ttt pss ro) 
pesdecdesdssdesdesdeslestecleclecdestecleclecdvsTesTesTecslestosleslesluclastecleslecleslecleclesleclec[e:]e: IMME gat. ) 


r 
0 
7 603 
7 604 
4 605 ; 
f 809 : 
7 88 : 
f 610; 
r.. @13-3 
7 916 3 
? 615 3: 
7 614; 
ee 
0 , ei? > INPUT PARAMETERS: 
0B7 619: RS: Address of Failover Control Block 
087 og : R4: Address of CL 
° 4 ? = R3: Failover ID 
9 , 6 : OUTPUT PARAMETERS: 
087 é 5: NONE 
0B7 6 3 t COMPLETION CODES: 
0B7 628; 
7 ei 
0 u 631 : SIDE EFFECTS: 
0B7 6 : : NONE 
OB7 634; 
OB? 636 
087 $39 CNXSFAILO_END:: 
5E 04 CO O0B7 638 ABDL2 #4,5P 3; Remove caller's address 
05 OOBA a7. RSB ; Return to caller's caller 
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-SBTTL CNXSIO_SYNCH = Synchronize with I/0 cluster 


+ 
aa 


FUNCTIONAL DESCRIPTION: 
Synchronize with the 1/0 cluster. 


Serialize 1/0 to multi-host disks to ensure that I/0 issued under current lo 
completed before 1/0 issued under locks granted after some locks have been r_| 
as the result of removing nodes from the cluster. 
Note that it is unnecessary to do this operation if nodes are being added an | 
are being removed. 


This entry is called following a synchronization call to ensure that all nod 

have disconnected from a node being removed before this call is made. It is. 

important that the node being removed has either lost quorum or crashed and | 

therefore ceased to issue I/0's before this call is made. | 
CALLING SEQUENCE: 


JSB CNX$10_SYNCH 
IPL is IPL$_SCS 


INPUT PARAMETERS: 


0088 RS: Address of Failover Control Block 

0088 R4: Address of CLUB 

0088 R3: Failover ID 

0088 

ios OUTPUT PARAMETERS: 

0088 NONE 

3288 

0BB COMPLETION CODES: | 
088 
0BB NONE 


SIDE EFFECTS: 
Disks are thrown into mount verification. 
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CNX$10_SYNCH: 
JMP 


00000000'GF 17 G*EXESCLUTRANIO : Synchronize with 1/0 cluster and return 
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~SBTTL CNXSFAILO_SYNC = Inter-Node Synchronization Failover Routine 


<< 


NNO OOOOOOOO Z-- 


Ot ed 8 ad ad ad md > —B QHOODODOOOOOOOOOOOONOD O- 


ODNAULS WO OD NAU EW ODOM SE wr 


won 


* 
+ 


FUNCTIONAL DESCRIPTION: 


This routine synchronizes failover action over all involved nodes. 
This failover routine does not complete until all nodes have completed 
the previous failover step. 

A new failover may be begun while this routine is in progress. 


The synchronization algorithm is a follows: 


One node is chosen as the node to drive the synchronization. ALL other 
nodes send a message to the sync node when this routine is executed. 

If the sync node NAKs the message, the node waits and tries again. If 
the message fails, the node drops its thread, confident that a new 
failover will begin sometime and start the whole thing over. If the 
message succeeds, the node then waits for a DOSTEP message, upon 
receipt of which it exits from this failover step. 


Meanwhile, the sync node waits for all other nodes to report in. When 
a message is received for the right failover, it is noted and ACKed. 
If a message is not for the current failover, it is NAKed. After all 
nodes has reported, a DOSTEP message is sent to every other node. 

When a connection breaks the CNXSCON BREAK routine sets a bit that 
eliminates that node from consideration by this routine. 


CALLING SEQUENCE: 


JSB CNX$FAILO_SYNC 
IPL is IPL$_SCS 


INPUT PARAMETERS: 


OOOOCOCCooooooooooo 
COOOCCCOCOCCOCOCOCooooo 


CNXSFAILO SYNC:: ; 
BBS #CLUF CBSV_SYNC_N F. - ; Branch if this is the 


03 20 a5 02 €0 OD 
GLUFCBSL_STATUS(R5 


: synchronizing node 


20 
5 
: 
6 RS: Address of Failover Control Block 
27 R4: Address of CLUB 
3 R3: Failover ID 
9 OUTPUT PARAMETERS: 
¢ NONE 
¢ COMPLETION CODES: 
0 § NONE 
6 3 SIDE EFFECTS: 
40 NONE 
re 
23 
44 
t2 
a 
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0098 31 8 ok BRwW 100$ ; Branch to code for other nodes 
29 60 A4 36 C £29 5$: MOVZWL CLUBSW_LOCAL_CSID_IDX(R4),RO ; Local CSID index field 
00 48 AS 50 €E C fe BBCS 7 : Mark local node responded 
D 7 § FL UF COSB_RESPRAP(AS) .78 
0B si : 734 7$: BRB 4 ; Branch to base of loop 
20 aS 6©10~=Co C8 ee 755 10$: BISL2 #CLUFCBSM_WAITING, - ; Set waiting bit 
00g 136 CLUFCBSL_STATUS(R5) 
0103 30 D 75 BSBW DELAY ; Wait for 0-1 second 
20 AS 610 CA 00D8 £38 BICL2 #CLUFCBSM_WAITING, - ; Clear waiting bit 
gor 75 CLUE COet STATUS(R5) 
7 86 6©flhlU DD DF 760 14$: MOVL #CLUFCB$S_RESPMAP=-1,R1 ; Byte counter 
50 28 A541 48 A541 8B pee 761 15$: B1cB3 CLUFCBSB_RESPMAP(RDSER11, - ; See if all currently involved 
a5e 166 ELF CESS _JRDERAP CAD» R11], - ; nodes have responded 
gS 12 BRee 764 BNEQ 10$ ; Found unresponded node(s) 
F351 F4& QOEC 765 SOBGEQ R1,15$ ; Iterate over all nodes 
55 dD ta £68 volts R5 ; Save register 
48 AS 20 00 6€ 00 e2 Bee) yor MOVC pe ey ae 3; Zero out RESPMAP for next use 
OOF 769 CLUF CB$B_RESPMAP(RS) 
20 =BA aye ory POPR #*M<R5> ; Restore register 
OOFA ure ; ALL reponses have been received. Send a message to every 
; node telling it to proceed with the next step. 
od. ae d Lling i d with th 
54 OOFF 8F 3C¢ OOFA 775 MOVZWL #<CLUFCBSS_NODEMAP*8>-1,R4 ; Index in bitmap 
09 = =11 gore ae BRB 30$ ; Branch to loop entrance 
0103 o78 19$: BUG_ CHECK CNXMGRERR,FATAL ; Consistency check 
30 y 105 780 20$: POPR #*M<R4,RS> ; Restore index and CLUFCB addresses 
0004 0 i 44 BSBW DELAY > Wait 0-1 seconds 
04 28 AS 54 £0 O10A 7 é 30$: BBS R4 , CLUF.CBSB_NODEMAP(RS) , 508 : Branch if in map 
F854 F4& O10F 783 40$: SOBGEQ R4,30$ 3; Iterate over all possible bits 
05 o118 oe RSB ; ALL done, return to caller 
51 Q0000000'GF DO 011 re 50$: MOVL GCL USGL CLUSVEC Rt ; Address of cluster vector 
5 6144 » Rie 787 MOVL (R1)CR4I]—R ; Address of CSB | 
1 11 oes BGEQ 19$ ; Not a CSB -- consistency check 
FA 60 A3 18 €0 O1 789 BBS #CSBSV_LOCAL, - ; Branch if local node to avoid 
1 790 CSBSL_STATUS(R3) ,40$ ; _ trying to send message to self 
30 1 ; 791 PUSHR #*M<RZ,R5> 3; Save context 
43% 1 85 BSBW CNXS$ALLOC_CDRP_ONLY ; Get a fork block 
FEDS" 30 O12A 79 BSBW CHXSRESOURCE_CRECK ; Watch resource availabilit 
550 €9 O12D 794 BLBC RO,20 ; Branch if no memory available 
5 AE DO 0130 795 MOVL  4(§P),R4 : Restore CLUFCB address 
2c AS OC QA : 4 a MOVZBL Na ant HegpOster. - ; Function code for message 
0093 Hi 1 H 79 BSBW 00$ ~ ; Initialize CDRP for mesage but ding 
FEC2' 0 0138 79 BSBW CNXSSEND_FORGET ; Send message to remote node and return imm 
BA 01 0 POPR #*M<R4,R5> 3; Restore index and CLUFCB addresses 
BC 20 AS E2 014 1 BBSS  #CLUFCBSV_FKB_BUSY, -_ ;: Branch if busy and mark busy 
14 ; CLUFCBS$L_STATOS(R55,19$ 
0B AS 08 90 Q14 MOVB giPL Sot = = 3; Store IPL in fork block 
149 4 FKBSB_FIPL(RS) 
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08 BA 0149 5 POPR #*M<R3> ; Save caller's address 
Q0000000'GF 16 014 § JSB G*EXESF ORK ; Release control momentarily 
; DD 01 PUSH ; Restore caller's address 
20 AS CA Q1 § BICL #CLUFCBSM_FKB_ BUSY, - ; Clear busy bit 
1 0 LUF CBS STATOS (RSS 
B3 20 AS 01 #4€1 0157 10 BBC thet V_PENDING, = ; Branch if no failover is 
15¢ 811 CLUFCBSL_STATUS(R5),40$ : pending 
05 O15C 1 RSB ; Return - new failover is pending 
130 Bla 
15D 15 ; This concludes the code that executes in the synchronizing node. 
13 1§ ; Here begins the code that executes in the non=-synchronizing nodes. 
90 BA 0Q15D 18 208: POPR #*M<R5> ; Restore CLUFCB addresses 
dD =610 FA i 95$: BSBB DELAY 3; Wait 0-1 seconds 
o161 1 ; Send message to synchronizing node telling it that we are ready for 
191 : § 3; a command to finish a failover step. 
53 2405 00 5181 824 100$:  MOVL CLUFCBSL_SYNC_CSB(R5) ,R3 ;_CSB address of synchronizing node 
55 DD 0165 825 PUSHL R ; Save context 
434% $ 0167 8 § BSBW CNXSALLOC_WARMCDRP_CSB ; Get a fork block 
FE9S' 30 QO16A 8 BSBW CNXSRESOURCE_CHECK ; Check for pool exhaustion 
ED 50 €9 0160 828 BLBC RO,90$ ; Branch if no memory available 
10 BA 8128 86? POPR #*AcRd> ; Restore CLUFCB address into R4 
40 AS 54 D0 17 830 MOVL R4, CORPSL_VAL6(R5) ; Save CLUFCB address 
44 AS BEDO 0176 831 POPL  CDRP$L_VAC7(R5) : Completion address 
2C AS 0B 9A OI7A B36 MOVZBL #CLMCNRSK_FNC_READY, - ; Function code for message 
O17E 83 DRPSL_VAC1(R5) 
4E 10 E43 834 BSBB ; Set up CDRP for message | 
FE7D" 30 gig HH BSBW CNXSSEND_MSG_CSB ; Send message to remote node 
018 837 ; Resume here when the response message arrives or the connection 
BIg Ht 3 breaks. 
01 839 ; Registers contain: 
018 840 ; RO: Status 
Hy 841 ; Re: Address of message buffer 
1 oes : R3: Address of CSB 
018 se ; R4: Address of PDT 
Bt rf: 3 RS: Address of CORP 
44 AS DD 4 £6 PUSHL CDRPSL_VAL7(R5) ; Completion address 
30 AS DD 0186 4 PUSHL CDRPSL_VAL2(R5) ; Failover sequence number 
40 aS 3 189 rk PUSHL CORPS$L_VAL6(R5) ; CLUFCB address 
FE71° 18C 4 BSBW CNXSPROCESS_RESPONSE ; Standard response processing 
eo BA 018F 50 POPR #*M<R5> ; CLUFCB address 
1C AS 3 D1 191 51 CMPL (SP)+,CLUFCBSL_ID(R5) 3; Same step? 
¢ \¢ 138 2¢ BNEQ 110$ ; Abandon this failover 
01 D 19 CMPL t! #1 ; Test return status 
C3 1F O19A 854 BLssu 95 : Branch on NAK 
1c =#6«1A 136 2? BGT 1108 ; Connection is broken 
3 2 3; ACK received 
27 20 AS O03 €2 Q19 35 BBSS #CLUFCBSV_FKB_ BUSY, - _; Branch if busy and mark busy 
1A 60 CLUFCB$L_STATOS(R55,140$ 
20 A5 10 C8 OIA 61 BISL2 M#CLUFCBSA_WAITING, - ; Set waiting bit, error if 
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v04-000 CNXSFAILO_ SYNC 1 Inter- “Rode Synchronizat "g° SEp=] 984 Be :09s47 SYSLOA.SRCJFALMAN.MAR; 1 . 3) 
1A7 ¢ CLUFCB SL STATUS(R5) ; already set 
53 6€ »DO OQI1A7 MOVL ; Save return PC 
1AA 4 ASSUME FKBSL_FR4,£Q,<FKBSL_FR3+4> 
19 AS 7D O1AA 65 MOVQ R3,FKBSL_FR3(R5) ; Save R3 and R4 
0c BEA? 9 1A g MOVAB Br {20s FRBSL_FPC(RS) : Restart PC 
65 23 dO 018 6 MOVL  R5,(R5$ ; Link to self 
04 A 5 OD 1B6 68 MOVL R5,4(R5) 
5E . + 4 1108: soos #4,SP : Forget return address 
1BE 71 
os a0ns 8 BB HBL HE BE Be rcasy sua ousy, « tsset Sasa itagt Seeres 
~ ear bus a 
15 B74 CLUFCBSL STATOS(RSS 1408" sieblacne 
20 AS 10 CA 1¢5 75 BICL2 #CLUFCBSA_w ING ; Clear waiting bit 
0109 876 CLUF CBSL "STATUSCRS) 
OF OER Bre ae 
gree t 140$: BUG_CHECK CNXMGRERR,FATAL ; Consistency check 
1CE 881; 
ab +o ; Initialize CDRP for building messages 
O1CE 884: RS: Address of CDRP 
Bite oe? i 3 R4: Address of CLUFCB 
4C AS O4'AF QE BicG Ht $008: MOVAB Bete Stee SG. ; Message build routine address 
30 AS 1C A4~ DO g1Ds +34 MOVL ELUFEBBL DCRE), ; Failover sequence number 
34 AS 18 AS DO O01D8 = 891 MOVL CLUFCBSL_STEP(R4), - ; Failover routine index 
0s 2108 a36 ae CDORP$L_VAL3(R5) 
Q1DE 894 
01DE so 3 
01DE 96 ; Wait 0-1 second on fork and wait queue 
Bie 444 ; If a new failover is requested, return to caller's caller 
O1DE $38 : R5: Address of CLUFCB fork block 
O1DE it ; RO“ RG are destroyed. 
1DE 902 BELAY: 
1D 20 AS «603)—S—tsC@E? Bie $08 BBSS oth Ue cosy Pua Busy Ys 508 ; Branch if busy and mark busy 
0B AS 08 90 IE 305 MOVB #IPLs_ SCS 5) "" Store IPL in fork block 
10 BA OQI1E7 307 POPR #*M<RE> ; Save return address 
08 BA O1E9 9 #*M<R3> ; Save caller's caller's address 
1EB 309 FORK_WAIT 3; Wait and hope some responses appear 
5 DD O1F1 910 PUSHE R3 ; Restore caller 5 caller's address 
08 20 A5 33 E5 iF $e BBCC Mae aay ree eyass 2 ,208 ; Clear busy bit 
1 S #CLUFCBSV_PENDING ; Branch if another failover is 
Seem ifD HF os CLUFCBSL “STATUS (RS) , 10$ ; _ pending and do it 
1FD 15 PUSHL R4 ; Restore return ee 
1FF 318 10$: RSB 
$6 918 208: BUG_CHECK CNXMGRERR,FATAL ; Consistency check 
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-SBTTL BLD_STEP_MSG = Build Message to Step Failover 


+ 
+ 


FUNCTIONAL DESCRIPTION: 


This routine builds the messages used to sequence failover steps 
from data in the CORP. 


CALLING SEQUENCE: 


JSB BLD_STEP_MSG 
IPL is IPL$_SCS 


INPUT PARAMETERS: 


ae Address of message buffer 
R3: Address of CSB 
RS: 


Address of CDRP 
RP$L_VAL1(R5): Byte 0 contains facility specific function code 
CDRPS$L_VAL2(R5): Failover sequence number 
CORPSL_VAL5(R5): Failover step index 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 


SIDE EFFECTS: 
RO and R1 are destroyed 


BLD_STEP_MSG: 
Aove 


08 A2 O01 90 #CLSMSGSK_FAC_CNX, - ; Facility code 
CLSMSG$B FACILITY (R2) 
09 A2 2C AS 90 MOVB CDORP$L_VAL1(R5), - : Facility specific function code 
CLSMSGSB FUNC (RE) 
0c A2 30 A5 00 MOVL CORP SL OVAL cna) - ; Failover sequence number 
10 A2 34 A5 00 MOVL CORPSL SRL SCR ,- ; Failover step index 
CLMSTPSL_STEP(R2) 
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-SBTTL CNXSRCVD_READY = Ready for Failover Step Message Received 


++ 


FUNCTIONAL DESCRIPTION: 


a failover step is received. 
CALLING SEQUENCE: 


JSB CNX$RCVD_READY 
IPL is IPL$_SCS 


INPUT PARAMETERS: 
a Message address 
R3: CSB of sending system 
R4: PDT address 
R5 

; OUTPUT PARAMETERS: 

NONE 

; COMPLETION CODES: 
NONE 

SIDE EFFECTS: 
RO-R5S may be destroyed. 


CNXSRCVD_READY:: 
USHL 


This routine is called when a message from a node ready to perform 


CORP address (uninitialized) 


B R ; Message buffer address 

BSBW CNXSINIT_CDRP ; Initialize the CORP for the response 

POPR #*M<R2> 3; Restore message buffer address 

MOVL CSB$L_CLUB(R3) ,R4 3; Address o 

MOVAB CLUBS$B_CLUFCB(R4) .R4 ; Address of failover block 

CLRL RO ; Assume failure 

CMPL CLMSTPSL_ID(R2), = 3 Is the expected failover in progress? 
CLUF CBSL_ID(R4) 

BNEQ $ ; Branch if not and ignore message 

CMPL CLMSTPSL_STEP(R2), - ; Has this step been passed? 
LVF COT. _STEPCAS? 

BLSSU $ :; Branch if yes =~ something is yrong 

BBC #CLUFCBSV_SYNC_NODE, - ; Branch if this is not the synchronizing 
CLUFCBSL_STATUS(R4),90$ ; node 


MOVL -#1,R ; 
MOVZUL CSBSW_CSID_IDX(R3),R1—; 


MOVL  CSBSL EL.UB(R3) R4 ; 
MOVZBL #CLMCRXS$K_FNC_READY,RO : 
BSBW  CNXSINIT_STD_RESP : 


Set success 
CSID index of sending system 
et the response bit corresponding 
to the sending node 
Save status and CLUFCB address 
Address of CLUB 
Message | 
Set up response message 


Page 45, 
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36 AS 6 29 3 1 MOVB = (SP) DRPSL L AVAL3+2(R5) ; Store status flag 
FDAG' 1 BSBW CNXSRES ; Send response an 
1 BA A POPR SRS” : Restore gtetus oad “ELF ce address 
OF 50 —s«E9 ; 1 BLBC ; Branch if message NAKed 
OA 20 AS 4 €1 1 BBC #CLUFCBSV_WAITING ; Branch if not waiting for responses 
CLUF CBSL “STATUS(RS) , 30$ 
: : Byechrentsing node waiting for nodes to respond. 
; up. 
66 1035 : 
55 65 OF 0266 1 : REMQUE (RS),RS5 Dequeue fork block 
6 1 ASSUME FKBSL _FR4,EQ,<FKBSL_ FR3+h> 
53 10 a5 7D 1 8 MOVa “FR3(R5S,R3 : Restore R3 and R4 
0c 85 «16 $0 1 JSB ores C_FPC(RSS t Call fork routine 
05 oy ! re 30$: RSB 3; Return 
71 +1042 90$ BUG_CHECK CNXMGRERR,FATAL ; Consistency check 
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~SBTTL CNXSRCVD_DOSTEP = Do Failover Step Message Received 


> 
+ 


FUNCTIONAL DESCRIPTION: 


This routine is called when a message from the synchronizing node is 
received requesting that a failover step be performed. 


CALLING SEQUENCE: 


JSB CNXSRCVD_DOSTEP 
IPL is IPL$_SCS 


INPUT PARAMETERS: 
4 f Resse € address 
Wai 
OUTPUT PARAMETERS: 
NONE 
COMPLETION CODES: 
NONE 
SIDE EFFECTS: 
RO-R5 may be destroyed. 


CNXSRCVD_DOSTEP:: 
PUSHL $L_ID(R2) 


CLMSTPSL_ 
PUSHL CLMSTPS$L~STEP(R2) 
BSBW  CNXSDEALC mSG BUF _CSB 
MOVL  CSBS$L_CLUB(R3) 

MOVAB CLUBS$B_CLUFCB(R4) RS 
POPR #*M<R 


Failover sequence number 
Failover step index 
Deallocate message ouffer 
Address of CL 

Address of failover block 


oP R1> SEQ to R1, INDEX to RO 

CMPL ——R1, CLUFCB$L_ID(R5) Is the expected failover in progress? 
BNEQ § Branch if not and ignore message 

CMPL RO, CLUFCBSL_STEP(RS) Check step number consistenc 


Branch if different -- something is wrong 
Branch if this is the synchronizing 


BBS #CLUFCBSV_SYNC_NODE, - 
node -- can't hap 


CLUFCBSL_STATUS(RS) .90$ 
BBC #CLUFCB ING, - 

CLUFCBSL_STATUS(RS5) ,90$ 
EQ,<FKBSL_FR3+4> 


ASSUME FKBSL_FRS 

MOVa oak FR3(R55,R 3; Resume fork block 

JSB aFKBSC_FPC(RSS ; _ and continue table 
20$: RSB 3; Return 


90$: BUG_CHECK CNXMGRERR,FATAL ; Consistency check 


appen | 
Branch if this node not waiting for respon 
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rf 1098 ~SBTTL CNXSFAILO_JNL = Initiate journal recovery 
AB 11 ° p++ 
AB 11 § 3 
4 1} Z 3; FUNCTIONAL DESCRIPTION: 
8 ~ 1 5; This routine find and invokes the journal recovery routine. 
" i : 3 CALLING SEQUENCE: 
AB 11 § 3 JSB CNXSFAILO_JNL 
AB 1319 3 Pe is IPL$_SCS 
AB 1111; 
m4 1116 ; INPUT PARAMETERS: 
8 AB 1114; R5: Address of Failover Control Block 
O2AB 1115; R4: Address of CLUB 
02A8 1138 : R3: Failover ID 
Q2AB 1117; 
O2AB 1118 ; OUTPUT PARAMETERS: 
Q2AB 1119; 
Q2AB 1120; NONE 
Q2AB 1121; 
8 AB 11 g ; COMPLETION CODES: 
AB 11 3 
O2AB 1124; NONE 
O2AB 1125 ; 
O2AB 1126 ; SIDE EFFECTS: 
QO2AB 1127 ; 
O2AB 1128 ; NONE 
Q2AB 1129 ; 
02AB 1130 ;-- 
8 AB 1131 
AB 11 § CNXSFAILO_JNL:: 
50 18 A4 09 O2AB 11 MOVL CLUBSL_JNL_FAIL(R4),RO ; Fetch and test routine address 
02 13 O2AF 1134 BEQL 10$ 3; No routine found, complete failover step 
60 17 ps8 1135 JMP (RO) ; Jump to routine -- it sees standard 
3 B3 1136 : failover routine environment 
$B 1137 
05 0283 1138 10$: RSB 
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+4 ~SBTTL CNXSCHECK_FAILOVER = Test for and begin pending failover 
B4 +e 


FUNCTIONAL DESCRIPTION: 
This routine tests for a pending failover and initiates the pending 
failover. If no failover is pending, a check is made for a certain 
future failover. It is is known that a failover will soon happen, the 
thread is dropped (return to caller's caller). If no failover is 
og Ny and no future failover is certain, return to the caller. 

t is assumed that failover table processing is in NOREFAIL mode. 

This routine is used by failover routines that operate in NOREFAIL 
mode to determine ether or not to continue failover processing or 
to abandon it in favor of starting over (immediately or in the near 
future). If a decision is made to abandon processing, no return 
is made to the caller. 

CALLING SEQUENCE: 
JSB CNXSCHECK_FAILOVER 
IPL is IPL$_SCS 
4(SP) : Address of caller's caller 

INPUT PARAMETERS: 
NONE 

OUTPUT PARAMETERS: 
NONE 

COMPLETION CODES: 
NONE 

SIDE EFFECTS: 


If return is to caller, RO and R1 are destroyed. 
If return is to caller's caller, RO-R5 are destroyed. 
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CNXS$CHECK_FAILOVER:: 
MOVL G*CLUSGL_CLUB,RO : Address of CLUB 
BBC #CLUFCBSV ACTIVE, - : Branch if in REFAIL mode and 
UBS$B_CLOFCB+CLUFCBSL_STATUS(RO), - 


; _ bugcheck 
BBC #CLUFCBSV_PENDING 


50. 00000000'GF 00 
1F O12C CO) 3=—f00tsé#€&* 


¢ 
NG, = ; Branch if no failover is pending 
LUBSB_CLOFCB+CLUF CBSL_STATUS(ROD , - 


BICL2 #CLUFCBSM ACTIVE, - ; Clear failover active bit 
CLUB$B_CLOFCB+CLUFCBSL_STATUS (RO) 
at 


0B 012c CO. «C01tsé#€’T 


012C cO)601Sfs«CCA 


SE 04 0 
FD2E 1 


08 1C AO 17 ~«€1 


‘ 3; Remove caller's return address 
CNXSMEMBERSHIP_CHANGE ; Initiate new membership change 


BBC #CLUBSV_LOST_CNX, - ; Branch if no cluster connections broken 
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v04-000 CNESCREEK FAILOVER - Test for and begin g-3EF =1382 89: 38:23 :47 ([{SYSLOA.S REI ALMAN.MAR; 1 . (i) 
D7 1197 CLUBSL FLAGS (RO) .20$ 
012C CO.) CUéité‘CKAR D7 11 8 BICL2 ecu ue cosh M_ACTIVE, = Tus th failover active bit 
pc 1 CLU 5 CLOPCB+CLUFCBSL.. staTus«RO) 
SE 04 £0 oc 1 y ADDL2 # ues ; Return to caller's caller 
OF | 8 208 RSB ; No pending failover, return 
ED | 08 30$: BUG_CHECK CNXMGRERR,FATAL ; Consistency check 
8 E4 1205 .END 
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Symbol table ° $78 SEP=1984 80: 36:23 247 (CSYSLOA.SRCJFALMAN.MAR; 1 ° By vO! 
BLD_STEP_MSG 00000204 R. CNX$I10_SYNCH 
BUGS CNXAGRERR eeeeeree 8 CNXSMEMBERSHIP_CHANGE She 
RPSL_MSGBLD = 4C CNXSPROCESS_RESPONSE aaccears 
CDRPSC 1 = f CNXSRCVD_DOSTEP 90000275 | 
CORPSL-VAL = CNXSRCVD- READY 000218 
CORPS$L_VAL = 8 4 CNXSRESOORCE CHECK teeerene a ; 
CORPSL_VAL = 040 CNXSRESP_FORGET eeneenee =X 
CDRPSL-V = 00000044 CNX$SCAN-CSBS eeneeeee =X 03 
CLMCNXSK_FNC_DOSTEP = 9000000¢ CNXSSEND~ FORGET teeters KX 03 | 
CLMCNXSK_FNC_READY = 00000008 CNXSSEND-MSG_CSB tereeeee KX 03 
CLR TPE STEP = 90090010 ESBSt STATUS : a9000060 | 
= = 
SOASCEB-PAEILITY = $0000008 ESBSUnLOCAL = $000001 
CLSMSG C = 00000009 CSBS$V_LONG BREAK = 00000000 
CLSMSGSK"FAC_CNX = 00000001 CSBS$V_ ER = 00000001 
CLUSGL_CCuB ~ tererees CSB$W_CSID_IDx = 0000004C 
CLUSGL— CLUSVEC rteereee DELAY 000001DE 03 
CLUB$B~CLUFCB = 0000010¢ EXESCLUTRANIO eeeeeeee KX 03 
$L_C = 0000005C EXESFORK eeneeeee =X | (03 
CLUBSL_FLAGS = 0000001C EXESFORK_WAIT eerereee KX 03 
CLUBSL_JNL_FAIL = 444d FKBSB_FIPL = 00000008 
CLUBS$L_LOCAL_CSB = 0000001 kes. ree = 0000000C 
CLUBSL_LST_XT = 00000034 FKBSL_FR = 00000010 
CLUBSV_LOST_C = 00000017 FKBSL_FR4 = 00000014 
CLUBSW-LOCAC_CSID_IDX = 00000060 FORCE_END = 00000027 
CLUF CBS$B_NODE = 4 4444 IPL$_SCS = 00000008 
CLUF CB$B_RESPMAP = 0000004 He Hd = 00000008 
CLUF CBS$L_ID = 0000001C IPL$_TIMER = 00000008 
CLUF CBSL_ STATUS = 00000020 LCKSINIT_REBUILD eeenenee xX o¢ 
CLUF CBSL_STEP = 00000018 LCKSREBUTLD_LKBS teeeeeeer = X 0 
CLUFCBSL_SYNC_CSB = et LCKSREBUILD-RSBS eeeeeeee = X 02 
CLUFCBSM_ACTIY = 00000001 LCKSRESUME_ALL eererere KX 0 
CLUF CBSM_FKB_BUSY = 00000008 LCKSRESUME xyNBROT eeeeeeee = X 0 
CLUF CBSM_PENDING = st 002 LCKSSET_S STATE teeneeee X 0 
LUF CBSM_SYNC_NOD = 00000004 LCKSSET"STA Tt eeeeeene x 9 
CLUF CBSM_WAITING = 00000010 LCKSSET_STAT uy eeereree =X 
CLUF CB$S_NODE = 00000020 LCKSSET- STATE teeeeeee xX 0 
CLUF CB$S_RE Pp = B28 0020 LCKSSTACL_ALL eeeeeeee =X 02 
CLUF CBSV_ACTIVE = 38 00 ° 
CLUF CB$V"FKB_BUSY = 000000 
CLUF CBSV~PENDING = 00000001 
L $V—SYNC_NODE = 00000002 
CLUE CBSV WAITING = 00000004 
CNX LOt_ CDRP keterere x 
CNXSALLOC “GARMEDRP CSB eeeereee § xX 
CNXSCHECK-FAILOVER™ 00000284 RG 
XSDEALL-MSG BUF _CSB tereeeee x ; 
CNXSEND_FAILOVER 00000059 RG 
ENXSFAILOVER TABLE 0 09 RG 
CNX$FAILO_END RG 
CNX$FAILO™ JNL AB RG 
CNX$FAILO_NOP 6 RG 
CNX$FAILO_NOREFAIL A4 RG 
CNXSFAILO_REFAIL OF RG 
FAILO C1 RG 
CNXSINIT_CDRP teneeeee 8 6X 
CNXSINIT— STD RESP eeeeeeee §6X 
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PSECT name Allocation PSECT No. Attributes 
apts - 0000000 0.) 00 ( 0.) NOPIC USR CON ABS LCL NOSHR NOEXE NORD NOWRT NOVEC BYTE 
358060 Googoacy < 196°) a 33) Wels Uae Ea REL LCL NOSHR EXE RD WRT NOVEC LONG 
$$$100 $000 ES ¢ 128:) 08 ( $:} NOPIC USR CON REL LCL NOSHR EXE RD WRT NOVEC LONG 
peewee mene ween nee eennnnnt 
éhemiennnemneml 
Phase Page faults CPU Time 
Initialization 29 90:00:00.04  00:00:01.06- 


ong be processing 


s 1 
pees table sort 
Pass 2 2 
Symbol table output 
Psect synopsis output 
Cross-reference output 
Assembler run totals 


~“N 
WwW 
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The working set Limit was 1650 pages. 

82112 bytes (161 pages) of virtual memory were used to buffer the intermediate cod 

{rere were 80 pages of symbol table space allocated to hold 1230 non-local and 58 locat symbols. 
iz? source Lines were read in Pass 1 "aie 19 object records in Pass 2 

18 pages of virtual memory were used to define 17 macros. 
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Macro Library name Macros defined 
"$255$DUA2 8:CSYSLOA.OBJJCLUSTER.MLB; 1 1 
*$255$DUA28:CSYS.OBJJLIB.MLB; 1 7 
“$255$DUAZ8: eySi IBISTARLET ALB; 2 3 

TOTALS (all Libraries) 1 


1289 GETS were required to define 13 macros. 
There were no errors, warnings or information messages. 
MACRO/LIS=L1IS$:F ALMAN/OBJ=OBJ$:FALMAN MSRC$:FALMAN/UPDATE=(ENHS:f ALMAN) ¢EXECML$/LIB+LIB$:CLUSTER/LIB 
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